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Abstract We have constructed a new, fast, robust and reli- 
able pipeline to detect variable stars from the ROTSE-IIId 
archival data. Turkish share of ROTSE-III archive contains 
approximately one million objects from a large field of view 
(1.85°) and it considerably covers a large portion of north- 
ern sky (8 > -25°). The unfiltered ROTSE-III magnitude 
of the objects ranges from 7.7 to 16.9. The main stages of 
the new pipeline are as follows: Source extraction, astrom- 
etry of the objects, light curve generation and inhomoge- 
neous ensemble photometry. A high performance comput- 
ing (HPC) algorithm has also been implemented into the 
pipeline where we had a good performance even on a per- 
sonal computer. Running the algorithms of the pipeline on a 
cluster decreases analysis time significantly from weeks to 
hours. The pipeline is especially tested against long period 
variable stars with periods of a few hundred days (e.g Mira 
and SR) and variables having periods starting from a few 
days to a few hundred days were detected. 
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1 Introduction 



Robo tic Optical Transient Search Experiment (ROTSE - Akerlo fet all 
2003) is a network of telescopes located all around the worlcf^. 
The primary goal of the ROTSE-IIIproject is to observe 
Gamma-Ray Bursts (GRB) in optical light. Each ROTSE- 
Illtelescope consists of 45 cm with a wide field of view 
(1.85°). The telescopes were built to respond rapidly to GRBs 
(<10 s) which are triggered by satellites such as Swift, In- 
tegral and HETE. The ROTSE-IIIsystem runs unattended 
with fully automated observ ation, data acquisition and anal- 
ysis (see Rvkoff etal d2005h for the pipeline). 

The ROTSE-IIIcollaborationuses 70% of each ROTSE- 
Illtelescope's observation time. The rest of the time is allo- 
cated for discretion by the local organization. The ROTSE- 
Illdtelescope is located at TUBiTAKNational Observatory 
(TUGjI, Bakirhtepe, Antalya, Turkey. Scheduled observa- 
tions were started in May 2004 and they were distributed 
among the Turkish astronomers by TUG. In this work, all of 
the public Turkish observations have been used. Some of ar- 
ticles which made of using the ROTSE- II Iddata of Turkish 



share are as follows 



Bavkal et all J2005h . iBilir eta3 d2006h 



and lKiziloglu et all 12009). 

The aim of this work is to detect variability as well as 
finding new variables using the ROTSE-IIIdarchival data. 
To achieve this main goal a multistage algorithm were devel- 
oped and then they are crafted into a series of routines: the 
pipeline. In section l2~Tl ROTSE-IIIdobservations and struc- 
ture of the data, and in section l2T2l the summary of ROTSE- 
Illdpipeline are given. The data handling, our pipeline and 
its structure are given in section [3] The main stages of the 
pipeline are as follows: source extraction ( 93.2) . astrometry 
of field stars ( jj3.3b . light curve generation ( jj3.4) and inho- 



1 jhttp : //rotse . net/ information /wo rid/ | 
2 TUG: [http : / /www, tug . tubitak . gov . tr / 1 
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Table 1 Magnitude limits of ROTSE-IIIdexposures. 
Exposure Time Saturation Magnitude Limiting Magnitude 



5 s 
20 s 
60s 



7™7 
9™0 
lOTO 



15"?6 
16T2 
16™9 



mogeneous ensemble photometry (^33J. We conclude our 
work with results and some suggestions for future work. 



2 ROTSE Data 

2.1 ROTSE-IIIdobservations and structure of the data 

The ROTSE-IIIdhas an 45 cm primary mirror and is outfit- 
ted with a 2kx2k TE cooled, CCD camera with 3.3 "/pixels, 
making a 1.85 field of view. QE (Quantum Efficiency) of 
the CCD pe aks at 550 nm. The telescope and CCD were 
described in lAkerlof et all (12003b . 

The CCD observations have been carried out in three 
different exposure times: 5 (~ 54%), 20 and 60 seconds. 
There are two important magnitude limits for these expo- 
sures given in Table[T] saturation (mean of maximum bright- 
ness of stars having no saturated pixels in their Full Width at 
Half Maximum - FWHM) and limiting (mean of the faintest 
star's magnitudes). 

The unfiltered ROTSE-IIImagnitude of the objects, de- 
pending on the exposure times, ranges from 7.7 to 16.9. 
Another image quality check was the FWHM of the point 
spread function on an image which ranges between 2 to 25 
pixels and on the average it is taken as 5 pixels. The Turk- 
ish share of ROTSE-IIIdarchive covers 2,210 deg 2 which is 
3.4% of the whole sky. 

The journal of observations used in the pipeline ranges 
between May 2004 and June 2010 and it consists of 234,764 
frames from 645 different pointings. The size of the archived 
data from the Turkish share (regardless of the pointings) 
since 2004 is approximately 2 TiB. 

266 pointings were observed less than 100 times and 
they were not included in the pipeline. This limiting value 
is an arbitrary choice. However, to have statistically reliable 
data sets we had to implement this minimum lower limit. 
This numbe r were also used in other large volume ROTSE- 
Illanalyses (IWozniakil2004l) . 



2.2 The summary of ROTSE-IIIdpipeline 

The data acquisition system is constructed on top of several 
daemons (e.g. weather, clamshell, camera). The telescope is 
operated under two modes: alarmed and scheduled. The lat- 
ter mode is used in the entire Turkish share and thus, in our 
pipeline. The ROTSE-IIItelescopes have a well designed 



data reduction pipeline allowing the near real-time process- 
ing of the CCD images taken by the entire ROTSE-IIInetwork. 
Regardless of types of observation carried out on the tele- 
scope, all images are fed into the ROTSE-IIIdpipeline. Fur- 
thermore, CCD images of all observations (taken with ei- 
ther alarmed or scheduled mode) were automatically pro- 
cessed (bias subtracted, flat fielded and fringe corrected) im- 
mediately after the frame has been do wnload to the disk 
(Rvko ff et alll2005l) . SExtractorsoftware dBertin and Arnouts , 
1996h is then used to detect objects, to measure centroid 
positions and to determine instrumental magnitudes (using 
5 pixels aperture). In addition to centroid positions, round- 
ness and sharpness values are also used to eliminate non-star 
like objects. Instrumental magnitudes of each object in the 
frame is calibrated by comparing a ll the field sta rs against 
the "USNO-A2.0 R-band catalog" jMonetlfl998l) (because 
the R filter is the nearest to the QE maximum) to obtain 
ROTSE-IIImagnitudes. By simply using triangulation method 
from the USNO catalog each object's coordinate and in- 
strumental magnitude are calibrated at the same time. The 
pipeline finally outputs files of calibrated object catalogs 
which are tabulated data in the form of binary FITS tables. 
The algorithm of the pipeline were coded in IDL and sche- 
matic representation is shown in FigureQ](blue colored blocks) 
Thus, the frame (and each detected object in the frame) can 
now be used in light curve analysis using object's a, 8, mag- 
nitude and error in magnitude. 



3 The New Pipeline 

3.1 Data Handling 

Input data for the pipeline can either be a calibrated object 
file or any corrected image file of the ROTSE-IIId. This is 
required if the frame is not valid i.e. the WCS (World Coor- 
dinate System) headers were wrong or there were recorded 
bad weather conditions, or technical problems. Otherwise 
the frame is valid and it enters to the stage-C of the pipeline. 
Thus, using these filtering methods, at least 80% of the archive 
was handled with the pipeline. The algorithm of the pipeline 
shown in Figure Q] (yellow colored blocks). 

It seems that the first two stages of our pipeline dupli- 
cates the ROTSE-IIIdpipeline (see FigureQ}. However, since 
(a) SCAMP has been used in astrometry calibration (see 
93. 3t and (b) the parameter set that we use in our pipeline 
was not produced by the ROTSE-IIId, we had to introduce 
our own stages in the pipeline. 
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Fig. 1 A combined flow chart view of ROTSE-IIId (blue colored) and 
our new (yellow colored) pipelines. 



3.2 Stage- A: Source Extraction 

ROTSE-IIIdcorrected images are used as INPUT data in 
this stage. A catalog of stars with their instrumental mag- 
nitude and frame coordinates is produced as an OUTPUT. 

SExtractorcode ( Bertin and Arnoutsi 1996 ) is used to iden- 
tify (and to perform an aperture photometry) the stars in the 
frame. Four types of magnitudes are calculated by SExtrac- 
torwhere only one of them is used in the pipeline, namely 
MAG_APER (magnitude from apertur e). A class ical value 



of 5 pixels ROTSE-IIIdaperture (e.g. lKiziloglu et allll995l) 
was used in the SExtractorsettings. The output of SExtrac- 
torconsists of instrumental magnitude, frame coordinates and 
many statistical calculations for each star. By using the SEx- 



tractor's statistical output we have improved the reliabil- 
ity of magnitudes calculated which wasn't possible using 
ROTSE-IIIdpipeline; it contained only a few columns of in- 
formation. 

As an example; for the ROTSE-IIIdpointing of 0006+4305 
there were 12,0 37 sources in "US NO-B 1 R" catalog (here- 
after USNO-B; lMonet et allliooll) for m R < 1 8™0. When the 
SExtractoris applied to this pointing, depending on atmo- 
spheric conditions, number of sources varied between 279 
and 11,399. Thus, the stage misses only 5% of the USNO-B 
sources. 



3.3 Stage-B: Astrometry of Field Stars 

The output of stage-A is used as an INPUT. Similar to the 
ROTSE-IIIdpipeline, the calibrated object catalogs are cal- 
culated as an OU TPUT 

The SCAMP dBertiii 12006.) software package is used to 
map USNO-B stars with the inputted field star coordinates 
so that a transformation matrix can be calculated. Instead of 
USNO-B catalog, GSC, UCAC or 2MASS catalogs could 
also be used. 

SCAMP has been improved for CCD frames taken from 
large FOV. The calculated astrometrical error of the ROTSE- 
Illdis approximately l-2"for its FOV (less than its pixel size). 
Therefore, SCAMP seems to be a suitable tool for astromet- 
ric calibrations in our pipeline. 

The pointing accuracy (o r error) of the ROTS E-IIIdtelescope 
is given as approximately I' dAkerlof et a 1 120031) . 

The mean of differences between "targeted frame cen- 
ters" and "frame centers recorded in frame headers" gives 
the pointing accuracy of ROTSE-IIId. These values are ap- 
proximately A a = 5.1' mdAd = _L0'. Since unaligned frames 
within a few arcmin ( Bertin , 20061) can be handled with 
SCAMP, this error doesn't effec t the resultant astrometry. 

Due to the telescope optics ( Giicsav , 20 id) , the best as- 
trometric alignment of the frames were done within the cen- 
tral 1.79°diameter of the whole ROTSE-IIIdframe. Thus, 
stars falling only into this region were used (see Fig. [2]). 

Since, SCAMP cannot handle frames having excessive 
image deformations (e.g.bad focusing, weather effects, tele- 
scope instability) they are automatically discarded and no 
output is produced. This feature of SCAMP is also used as a 
"filtering" algorithm for faulty frames. 



3.4 Stage-C: Light Curve Generation 

The output of stage-B or the output of ROTSE-IIIdpipeline 
is used as an INPUT. 

Note that, our input contains a list of equatorial coordi- 
nates and instrumental magnitudes of sources, namely it is 
called "calibrated object catalog" (see Fig. [TJ grey colored 
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Fig. 2 A sample CCD image from ROTSE-IIIdarchive overlayed with 
the chosen ROTSE-IIIdFOV (i.e.l.79°). 



box at the middle). Each inputted star's light curve (instru- 
mental magnitude vs. JD) is created as an OUTPUT. 

In this stage, the aim is to follow each star's equatorial 
coordinates in each frame throughout the whole time span 
and then collect the corresponding instrumental magnitudes 
at that coordinate. As in the stage-B, coordinates of USNO- 
B field stars are used to match calibrated catalogs. In order 
to do this a fixed circular aperture (namely "matching aper- 
ture"; hereafter <p m ) has to be chosen to scan through each 
catalog. The tj) m can be neither a low value (which increases 
the chance to miss the target star) nor a high value (which 
increases the chance to multi-match the target stars). There- 
fore, § m was chosen according to ROTSE-IIIdpixel scale 
(namely 3.3 "/pixel) and to achieve a standard Gaussian pho- 
ton distribution, 3 pixels range were taken. Thus, lO' was 
chosen as an optimum m value (4> m ) which will be used 
throughout the pipeline (Gucsav , 2010h . 



As a side effect of the matching algorithm, especially 
in crowded fields, there might still be multi-matches in the 
field. In such cases, the nearest star in the calibrated catalog 
to the USNO-B star was chosen as the match. The pipeline 
can be fine-tuned manually for some of the overcrowded 
fields by decreasing <p m to 6"to be able to decrease multi- 
matches. In a future version, this fine-tuning can be inte- 
grated into the pipeline by marking each field with a crowd- 
ness value which would make it possible to apply different 
(j) m values to each field whenever it is necessary. 



3.5 Stage-D: Inhomogeneous Ensemble Photometry 

The output of stage-C is used as an INPUT. Cleaned light 
curves are created as an OUTPUT. 

In classical photometry, a single comparis on star is used 

effectively for many year s by astronomers (see Henden and Kaitchuckl 
ll990tlYoung et all 1991 ). However, in recent years differen- 
tial photometry with "many comparison stars" has increased 
both reliability and accuracy of the light curves. Ensem- 
ble photometry is also a new kind of differential photom- 
etry which works with inhomogeneo us CCD data sets (e.g. 
Honevcutt , 1992 ; Saesen et al , 2O10h . We have also imple- 
mented the ensemble photometry in the pipeline to decrease 
statistical errors in instrumental magnitude of stars. For ex- 
ample, light curve of a 10?0 star was found to be almost con- 
stant. According to SExtractor, mean of RMS flux errors of 
the PSF fitting to the star was calculated to be 0'?006. How- 
ever, with the implemented ensemble photometry, we have 
reached to a scatter value of I ?002. Note that, this level of 
noise is also related to the other frame statistics (see scatter- 
error graph of the pointing that this star is located - Figure[3]); 
as the magnitude of the source decreases, scatter increases 
and therefore goodness of the PSF fitting decreases. As can 
be seen in the figure, while scatter of a lCPO magnitude star 
is around 2 mmag, it increases to 130-200 mmag at 16™0. 

The main aim of the technique is to find non-varying 
stars (i.e. reference stars) throughout the time span of the 
light curves. The objects that have non-star like shapes are 
all ignored by using SExtractor's roundness and sharpness 
analysis (see 32.21 ). Main criteria in choosing the reference 
stars are (1) to choose the stars far enough from edges of 
the frame, (2) to have the star isolated from the others, (3) 
to have no flags set in the SExtractor's output, (4) to have 
roundness value close to zero, and (5) to have sharpness 
value close to one. In order to not to foul the technique, light 
curves of mostly non-variable stars have to be used. There- 
fore, rough variability detection has to be applied to the light 
curves; namely 'scatter-and-error' relation of the light curve 
has to be calculated. According to the result of this relation 
the star can now be accounted as a reference. Afterward, a 
mean reference level is calculated from all chosen reference 
stars and it is used to calculate the relative (or differential) 
magnitude of other stars. 

By applying the technique second time (starting from the 
scatter-and-error relation), reliability of reference stars is in- 
creased: stars of the first run is inputted as the star list in the 
second run. With this run, variation of reference stars is de- 
creased and therefore the accuracy of the final cleaned light 
curves is increased. As an example, a reference star's light 
curves before and after ensemble photometry applied are 
given in Fig. [4] As seen in the figure, after ensemble photom- 
etry (stage-D) applied, scatter of the light curve (i.e.sigma) 
decreased. 
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Fig. 3 Mean magnitude (in magnitude) versus mean of RMS flux errors (in mmag) of sources in an example pointing which contains 31,984 
sources is given (see 33. 5t . The bright end of the graph is extended at upper left as an inset with the same units. Similarly, dim end of the graph is 
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This technique has the advantage of removing frame to 
frame background variations due to Moon light and unstable 
weather conditions. As a disadvantage of this technique, the 
absolute magnitude of stars cannot be calculated. A sample 
of the final light curve of a variable from our pipeline is 
given in Fig. [5] 

3.6 The Pipeline Structure 



In order to balance the CPU load between cores the same 
amount of star's data (both light curves and star's informa- 
tion) are stored. 

The prototype of the pipeline was started with IDL. How- 
ever, to be able to have a robust and fast pipeline, they are 
converted and parallelized into C-language. With the proto- 
type, it took approximately one week to create light curves 
of a single medium crowded pointing with a 4 cored PC. 
This duration decreased to about 2 hours with the pipeline. 



The new pipeline works on a small cluster called Infini- 
tus which made use of 8 different computers with 36 cores 
each having 8 GHz CPU speed. GNU/Linux operating sys- 
tem and Lustre 1.6.7.4 file system is used on all computers. 
Computers are interconnected with a gigabyte ethernet. The 
pipeline is mainly written in C language. Parallel algorithms 
have been used in every step of the pipeline which made use 
of MPICH20 library. 

In order to increase speed of the process a parallel file 
reading method is used in the pipeline. The files of each 
pointing (all the calibrated catalog stars in the entire time 
span) is automatically separated into all processors so that 
each pointing stays in the memory up to end of the stage-C. 

3 jhttp : / / www .mcs . anl . gov/ research/pro ject s/mpich2 



4 Conclusion 

We have constructed a new, fast, robust and reliable pipeline 
to detect variable stars from the ROTSE-IIIdarchival data. 
The main stages of the pipeline were as follows: Object 
identification, astrometry of the objects, light curve genera- 
tion and inhomogeneous ensemble photometry. For the first 
time in the ROTSE-IIIarchive, a high performance comput- 
ing (HPC) algorithm has been implemented into the pipeline. 
Depending on the data quality, either corrected CCD images 
(stage-A) or the calibrated object catalogs produced by the 
ROTSE-IIIdpipeline (stage-C) could be used in the pipeline. 
The last stage of the pipeline (stage-D), namely inhomoge- 
Heous ensemble photometry (implemented to the ROTSE- 
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Fig. 4 The light curves of a reference star before (top panel) and af- 
ter (bottom panel) ensemble photometry is applied. The sigma values 
represent the scattering of the curve. 



Illarchive for the first time), gives relative light variations of 
the measured stars with high precision. Within 645 pointings 
(see section [2TTT > of observations in 2004-2010, light curve 
of approximately one million stars are produced. 

Work on identifying new variables and searching for pe- 
riods of known variables are still in progress. The following 
statistical tests to detect variable stars have been applied to 
the light cur ves: Scatter-Error A nalysis (commonly used), 
Abbe Index dSaesen et all 2010h and Analysis of Variance 
( Wozniak, 2004). For period hunting the following tech niques 



have been used: PDM (Phase Dispersion Minimizat ion : I S tellin g 
1978 ), Lomb-Scargle periodogram dLombl 1 19761 IScarglei 
1982) and SIGnificance SPECtrum (SigSpec: lReegenil2007l) . 
A few thousands of light curves have passed from all three 
statistical tests mentioned above. These light curves most 
probably are un-identified variable stars. 

Light curves of approximately ten thousands stars from 
4 pointings are converted into phase-magnitude table. These 
phase graphs were then visually inspected for repeating pat- 
terns. Among these reduced list, 152 stars show variability 
and according to SIMBAD dGucsavl l2010h . 20 of them are 
unknown variables which are not classified yet. 

As a result of an early version of the pipeline, Mira and 
SR stars known in SIMBAD have been examined. New ac- 
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Fig. 5 Light curves (differential magnitudes vs. phase) of three se- 
lected variables which are automaticly produced by our pipeline. They 
all have different mean magnitudes and their mean statistical errors are 
also plotted on each point (except the top one; which is too small to be 
ed together). From top to bottom panel, the mean magnitudes are 
r 12T4 and 15"?5; and their mean statistical errors are 1 mmag, 10 



mmag and 40 mmag, respectively. 



tual periods of 78 Mira variables have b een identified and 
18 of them have a period for the first time (fYesilvapra k et all 
201 II) with early version of the pipeline. Also, approximately 
300 SR stars are und er investigation an d 15 of them were ex- 
amined in details by Dikicioglu d201lh . Tabulated values of 
minimum and maximum values of both amplitude and pe- 
riod are given in Table [2] 

As a by product of the pipeline, deformation and/or de- 
generation of frames could also be detected; e.g. background 
variations due to Moon light and unstable weather condi- 
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Table 2 Quantitative limits of light curves created by the pipeline. 

Amplitude Period (days) Reference 
Min. OTl 0.17±0.02 Gucsav (2010) 
Max. 5'?9 720.0±43.1 Yesilyaprak et al (201 1) 



tions. A surprising result from degradation of the ROTSE- 
Illdmirror was also noticed: the decrease in the number of 
detected stars is correlated with one magnitude decrease in 
the limiting magnitude. 

The main aim of the pipeline usage is to find new vari- 
ables, periods of known variables, and to classify these vari- 
ables using the above period hunting techniques. 

The pipeline can easily be adapted to other ROTSE- 
Illtelescopes which will increase the sky coverage and num- 
ber of detected unknown variables, transients etc. 
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